OUTLIER DETECTION ON HIGH DIMENSIONAL DATA USING MINIMUM VECTOR VARIANCE (MVV)

نویسندگان

چکیده

High-dimensional data can occur in actual cases where the variable p is larger than number of observations n. The problem that often occurs when adding dimensions indicates points will approach an outlier. Outliers are part do not follow distribution pattern and located far from center. existence outliers needs to be detected because it lead deviations analysis results. One methods used detect Mahalanobis distance. To obtain a robust distance, Minimum Vector Variance (MVV) method used. This study compare MVV with classical distance detecting non-invasive blood glucose level data, both at p>n n>p. test results show better for shows more effective identifying minimum group outlier method.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Outlier Detection on High Dimensional Data Using RNN

Background: Outlier detection is an important factor in data mining since it is used in various real time applications. Outlier is an extreme points that are not related to any of the class. Dealing with dimensions is the great challenge, due to “curse of dimensionality”, for effective outlier detection. In a high dimensional data space, it is difficult to detect most related points and most un...

متن کامل

Outlier Detection for Support Vector Machine using Minimum Covariance Determinant Estimator

The purpose of this paper is to identify the effective points on the performance of one of the important algorithm of data mining namely support vector machine. The final classification decision has been made based on the small portion of data called support vectors. So, existence of the atypical observations in the aforementioned points, will result in deviation from the correct decision. Thus...

متن کامل

Outlier detection for high dimensional data pdf

Is particularly useful for high dimensional data where outliers cannot be found.High dimensional data in Euclidean space pose special challenges to data. In about just the last few years, the task of unsupervised outlier detection has found.Outlier detection is an outstanding data mining task referred to open pdf with mac word class="text" href="https://tokiqivy.files.wordpress.com/2015/06/opel...

متن کامل

Outlier detection for high-dimensional data

Outlier detection is an integral component of statistical modelling and estimation. For highdimensional data, classical methods based on the Mahalanobis distance are usually not applicable. We propose an outlier detection procedure that replaces the classical minimum covariance determinant estimator with a high-breakdown minimum diagonal product estimator. The cut-off value is obtained from the...

متن کامل

Hybrid Approach for Outlier Detection in High Dimensional Data

It has been observed recently that the prominence of multidimensional data is increasing. Existing outlier detection techniques generally fail to work on multi-dimensional data. The need for analyzing high dimensional data has thus increased in today’s data trends. It has enormous application in medical domain, network intrusion and satellite imagery. Even though there are existing methodologie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Barekeng

سال: 2022

ISSN: ['1978-7227', '2615-3017']

DOI: https://doi.org/10.30598/barekengvol16iss3pp797-804